Modeling of audiovisual speech perception in noise
نویسندگان
چکیده
We present three models of audiovisual speech perception at varying signal-to-noise ratios (SNR). The first model is Massaro’s Fuzzy Logical Model of Perception (FLMP) applied at each SNR. The second model imposes the constraint that the visual response probabilities are the same regardless of the SNR. Both models describe the data well. Root Mean Squared Error (RMSE) corrected for the numbers of degrees of freedom was smaller for the latter model. In concordance, cross validated paired t-test showed that the latter model was significantly better at predicting individual performance despite the lower number of parameters. In a third model – a weighted FLMP – the SNR is parameterized reducing the number of free parameters substantially. This model fits the data significantly worse than the other two models, but does capture salient features of the change in performance with varying SNR.
منابع مشابه
Reliability of Interaural Time Difference-Based Localization Training in Elderly Individuals with Speech-in-Noise Perception Disorder
Background: Previous studies have shown that interaural-time-difference (ITD) training can improve localization ability. Surprisingly little is, however, known about localization training vis-à-vis speech perception in noise based on interaural time difference in the envelope (ITD ENV). We sought to investigate the reliability of an ITD ENV-based training program in speech-in-noise perception a...
متن کاملEffect of signal to noise ratio on the speech perception ability of older adults
Background: Speech perception ability depends on auditory and extra-auditory elements. The signal-to-noise ratio (SNR) is an extra-auditory element that has an effect on the ability to normally follow speech and maintain a conversation. Speech in noise perception difficulty is a common complaint of the elderly. In this study, the importance of SNR magnitude as an extra-auditory effect on speech...
متن کاملEnvelope-based inter-aural time difference localization training to improve speech-in-noise perception in the elderly
Background: Many elderly individuals complain of difficulty in understanding speech in noise despite having normal hearing thresholds. According to previous studies, auditory training leads to improvement in speech-in-noise perception, but these studies did not consider the etiology, so their results cannot be generalized. The present study aimed at investigating the effectiveness of envelope-b...
متن کاملAudiovisual processing of Lombard speech
Perception results are presented that address the role of Lombard speech in auditory and audiovisual speech perception. Basically, visual enhancement neutralizes the advantage of Lombard speech observed for auditory perception. It remains an open question whether or not Lombard speech is preferable for perception studies of speech in noise.
متن کاملTime is of the essence in speech perception! Get it fast, or think about it
Speech recognition occurs when attending to speech stimuli in auditory, visual, or audiovisual modalities under optimum (e.g., in silence) or degraded listening conditions (i.e., in background noise or in individuals with hearing impairment). The present thesis contains details of the first study to show how background noise (steady-state white noise) delayed the identification of different typ...
متن کاملAudiovisual Lombard speech: reconciling production and perception
An earlier study compared audiovisual perception of speech ’produced in environmental noise’ (Lombard speech) and speech ’produced in quiet’ with the same environmental noise added. The results and showed that listeners make differential use of the visual information depending on the recording condition, but gave no indication of how or why this might be so. A possible confound in that study wa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001